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Abstract 



■ Let q be a positve integer, and G be a g-partite simple graph on qn vertices, with 
CS| I n vertices in each vertex class. Let S — jr^^, where kq ^ q + 0{\ogq). If each vertex 

of G is adjacent to at least Sn vertices in each of the other vertex classes, q is bounded 
] and n is large enough, then G has a if^-factor. 

u; 

^ '. 1 Introduction 

In tills paper we will consider simple graphs. We mostly use standard notation: we denote 
by V{F) and E{F) the vertex and the edge set of the graph F, degpix) is the degree of the 
vertex x £ U(-F) and 5{F) is the minimum degree of F. 
^ ■ Let J be a fixed graph on q vertices. If g||U(F)| and F has a subgraph which consists 

, of \V{F)\/q vertex-disjoint copies of J, then we say that F has a J-factor. 

, A fundamental result in extremal graph theory is the following theorem of Hajnal and 

'nI" I Szemeredi [3]: 
I> ■ 

■ Theorem 1 (Hajnal and Szemeredi) Let G be a graph on n vertices such that 6{G) > 
OO , 2^72. If q divides n, then G contains n/q vertex- disjoint cliques of size q. 



q 

The theorem is obvious for q = 2; the first non-trivial case q = 3 was proved by K. 
Corradi and A. Hajnal [1]. The proof for arbitrary q is notoriously hard, it was found by 
A. Hajnal and E. Szemeredi in 1970. 

We say that F is multipartite, if its vertex set can be divided into classes which are 
independent sets. If the number of classes is q, then F is g-partite. F is a balanced q-partite 
graph, if these vertex classes are of the same size. Let F be a g-partite graph with vertex 
classes Ai, A2, ■ ■ ■ , Aq. We define the proportional minimum degree of F by 

xtTP\ ■ ■ (de-g{v,Aj) . 

0[r ) = mm mm J ; : 7 ^ i\. 

It is natural to investigate the multipartite version of Theorem 1: 
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Conjecture 2 Let G be a balanced q-partite graph on qn vertices. There exists a positive 
constant K such that if every vertex is adjacent to at least ^^-^n + K vertices in each of the 
other vertex classes, then G contains n vertex-disjoint cliques of size q. 

Notice the extra additive constant: it turns out that it is necessary to have K for 
odd qs. The conjecture is easily seen to hold for q = 2. It was shown for g = 3 [7] and 
g = 4 [8] . The proofs of these latter cases are very involved. In this paper we show a relaxed 
version. For k being a natural number let hk denote the fcth harmonic number, that is, 
% = l + i + i + ... + i 

Theorem 3 Let q > 3 be an integer and kg = q — 3/2 + hq/2. Then there exists an uq such 
that if n> uq, G is a balanced q-partite graph on qn vertices, and d{G) > jj^, then G has 
a Kg- factor. 

We also have the following corollary of Theorem 3: 

Corollary 4 Let G be as above. Assume that H is a fixed graph such that x{H) < q, and a 
constant number of vertex disjoint copies of H is colored by q colors such that we use every 
color and every color class has size k. If k divides n, then G has an H -factor. 

For proving Theorem 3 our main tools will be the Regularity Lemma of Szemeredi [9], 
and the Blow-up Lemma [4, 5]. We will give a brief survey on the necessary notions in the 
second section. 

2 Main tools for the proof 

Wc introduce some more notation first. For any vertex v of the graph G, degdv, X) is 
the number of neighbors of v in the set X, and e{X, Y) is the number of edges between 
the disjoint sets X and Y. Ng{v) is the set of neighbors of v and Ng{v,X) is the set of 
neighbors of v in X. For a set S C V{G), N{S) = Uv^sN{v). 

If is a natural number, and every vertex in the graph G has degree k, then we call the 
graph /^-regular, or simply regular. On the other hand, for a real e € (0, 1) we will consider 
e-regular pairs, these pairs play a crucial role in the Regularity Lemma of Szemeredi (more 
details follow later). 

Let F be a multipartite graph. Given certain vertex classes Ai-^, . . . ,Ai^ we will denote 
the s-partite subgraph of F spanned by these classes by F{Ai^, . . . ,Ai^). Throughout the 
paper we will apply the relation "<C" : a <C 6, if a is sufficiently smaller, than b. 

2.1 Factors of bipeirtite graphs 

Let F be a bipartite graph with color classes A and B. By the well-known Konig-Hall 

theorem there is a perfect matching in F if and only if |A'^(S')| > |5| for every S C A. The 
following, while simple, is a very useful consequence of this result, we record it here for 
future reference. 

Lemma 5 If F is a balanced bipartite graph on 2n vertices, and deg{x) > n/2 for every 
x G y{F), then there is a perfect matching in F. 
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If / : V(F) N is a function, then an /-factor is a subgraph F' of F such that 
degyiii}) = f(v) for every v £ V. We wih need special /-factors, namely when f = r for 
some r £ N. Then F' is an r-regular subgraph of F. If the minimum degree of F is large 
enough, then one can find a sufficiently dense spanning regular subgraph (see [2]): 

Theorem 6 Let F(A, B) be a balanced bipartite graph on 2n vertices, and assume that 6 = 
5{F)/n > 1/2. Then G has a \j}{5)n\ -regular spanning subgraph, where p{6) = ^+^/MEl_ 

2.2 Regularity Lemma 

The density between disjoint sets X and Y is defined as: 

e{X,Y) 



d{X,Y) = 



\X\\Y\ 



In the proof of Theorem 3, Szemeredi's Regularity Lemma [9, 6] plays a pivotal role. We 
will need the following definition to state the Regularity Lemma. 

Definition 1 (Regularity condition) Let e > 0. ^ pair {A,B) of disjoint vertex-sets in 
G is e-regular if for every X C A and Y C B, satisfying 

\X\ > e\A\, \Y\ > e\B\ 

we have 

\d{X,Y) -d{A,B)\ < e. 

This definition implies that regular pairs are highly uniform bipartite graphs; namely, the 
density of any reasonably large subgraph is almost the same as the density of the regular 
pair. 

We will use the following form of the Regularity Lemma: 

Lemma 7 (Degree Form) For every e > there is an M = M(e) such that ifG= (V, E) 
is any graph and d £ [0, 1] is any real number, then there is a partition of the vertex set 
V into i + 1 clusters Wq, Wi, . . . , Wi, and there is a subgraph G' of G with the following 

properties: 

• i<M, 

• \Wo\<e\V\, 

• all clusters Wi, i>l, are of the same size m ^< [^J < e|V^|), 

• degc'iv) > degciv) — {d-\- e)\V\ for all v £ V, 

• G'\wi = (Wi is an independent set in G' ) for all i > 1, 

• all pairs {Wi, Wj), 1 < i < j < i, are e-regular, each with density either or greater 
than d in G' . 
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Often we call Wq the exceptional cluster. In the rest of the paper we will assume that 
< £< d< 1. 



Definition 2 (Reduced graph) Apply Lemma 7 to the graph G = {V, E) with parameters 
e and d, and denote the clusters of the resulting partition by Wq, Wi, . . . , Wi, Wq being the 
exceptional cluster. We construct a new graph Gr, the reduced graph of G' in the following 
way: The non- exceptional clusters of G' are the vertices of the reduced graph Gr (hence 
\V{Gr)\ = i). We connect two vertices of Gr by an edge if the corresponding two clusters 
form an e -regular pair with density at least d. 

The following corollary is immediate: 

Corollary 8 Apply Lemma 7 with parameters e and d to the graph G = {V, E) satisfying 
^{G) ^ in i\V\ = n) for some 7 > 0. Denote Gr the reduced graph of G' . Then S{Gr) > 
(7 - e)i, where 9 = 2£ + d. 

The lemma below states that the property of being balanced can be inherited by the 
reduced graph. 

Lemma 9 Let G be a balanced multipartite graph, then Gr can be balanced as well. 

Proof: Trivial. □ 

Given an £-regular pair (A,B), we may increase A and B by adding some new vertices 
to both. We expect that after this procedure the new pair will be 77-regular for some small 
Tj, although r) > e. 

Lemma 10 Assume that < £ ^ 1/K. Let {A, B) be an e-regular pair with m=\A\ = \B\, 
and add Kem vertices to A and to B. Then the resulting new pair is 2^/£-regular. 

Proof: Simple computation. □ 

We will need the following simple lemma: 

Lemma 11 Let {A,B) be an e -regular-pair with density d for some e > 0. We arbitrarily 
halve A and B, getting the sets A'., A" and B',B", respectively. Then the following holds: 
{A', B') and (yl", B") are 2£-regular pairs with density at least d — e. 

Proof. Trivial. □ 
A stronger one-sided property of regular pairs is super-regularity: 

Definition 3 (Super-Regularity condition) Given a graph G and two disjoint subsets 
of its vertices A and B, the pair {A, B) is (e, (5)-super-regular, if it is e-regular and further- 
more, 

deg(a) > S\B\, for all a G A, 

and 

deg(b) > 6\A\, for all beB. 
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Let £ > and assume that the pair {A, B) is e-regular with density d. Mark those 
vertices of A which have less than [d — e:)\B\ neighbors and those which have more than 
{d + e)\B\. By the definition of e-regularity, there can be at most 'is\A\ marked vertices in 
A. Repeat the same procedure for B so as to mark those vertices which have too many or 
too few neighbors in A. If we get rid of the marked vertices of A and B then we will have a 
(3e, d — 3e)-super-regular pair {A' , B'). That is, we proved that every regular pair contains 
a large super-regular pair: 

Lemma 12 Let {A, B) he an e-regular pair with density d. Then it has a (3e, d — 3e)-super- 
regular subpair {A',B') where A' C A, \A'\ = \A\ - 2e\A\ and B' C B, \B'\ = \B\ - 2e\B\. 

We will repeatedly make use of the following folklore result, which states that random 
subpairs of (s, (5)-super-regular pairs are likely to be super-regular, with somewhat weaker 
parameters: 

Proposition 13 Let (A, B) he an (e, 5) -super-regular pair with density d and k he a positive 

integer. Assume that \A\ = \B\ = m, and k\m. Divide A and B into k random subsets: 
A = A1UA2U. . .UAk and B = B1UB2U. . -UB^, each having sizem/k. Then with probability 
tending to one as m tends to infinity we have that {Ai,Bj) is an (e' , 6') -super-regular pair 
with density d' for every 1 < i, j < k, where e' <2s, S — e < S' and d — e < d' . 

Let Gr be the reduced graph of the graph G such that edges in Gr represent e-regular 
pairs with density at least d. Assume that Gr is a cluster graph which we get by randomly 
splitting the clusters of Gr into sub-clusters of equal size. The new sub-clusters will be 
called split copies of the original cluster, and we will use to indicate that we refer to a 
split copy. 

Two split copies will be connected if they form an e'-regular pair with density d' where 
e' < 2e and d' > d—e. By the previous proposition if WiVFj G E{Gr) and Wj, Wj arose from 
Wi and Wj by the random splitting, then WiWj G E{Gr)- We will call Gr the refinement 

of Gr. 

2.3 Blow-up Lemma 

Let H and G be two graphs on n vertices. Assume that we want to find an isomorphic copy 
of H in G. In order to achieve this one can apply a very powerful tool, the Blow-up Lemma 
of Komlos, Sarkozy and Szemeredi [4, 5]. 

Theorem 14 (Blow-up Lemma) Given a graph R of order r and positive integers 6, A, 

there exists a positive e = e{S,A,r) such that the following holds: Let ni,n2, . ■ ■ ,nr be 
arbitrary positive parameters and let us replace the vertices vi,V2, ■ ■ ■ ,Vr of R with pairwise 
disjoint sets Wi, W2, ■ ■ ■ , Wr of sizes ni,n2, . . . ,nr (blowing up R). We construct two graphs 
on the same vertex set V = UiWi. The first graph F is obtained by replacing each edge 
ViVj G E(R) with the complete bipartite graph between Wi and Wj. A sparser graph G is 
constructed by replacing each edge ViVj arbitrarily with an (£,6) -super-regular pair between 
Wi and Wj. If a graph H with A (if) is emheddahle into F then it is already emheddable 
into G. 
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3 The first stage of the embedding algorithm 



Since a i^Tg-factor is a subgraph, finding such a factor will be considered as an embedding 
problem. Let us denote the union of n vertex-disjoint copies of KqS by H. We will show 
Theorem 3 by exhibiting a randomized algorithm which with high probability will embed 
H into G. 

The algorithm will proceed as follows: first, apply the Regularity Lemma to G with 
appropriately chosen parameters < e ^ d ^ 1, and get the balanced g-partite reduced 
graph Gr ■ The cluster classes of Gr are denoted by , ^2 , • • • , , here |^i| = |^2| = ••• = 

\A^\=t 

We will proceed in two stages. In the first stage we distribute the vertices of H among 
the non-exceptional clusters of Gr'- we require that (1) if two vertices of H are adjacent, 
then they should be assigned to adjacent clusters of Gr', and (2) about the same number 
of vertices should be assigned to every cluster - the difference cannot be larger than o{n). 
These requirements will be achieved via finding a iCg-factor in a cluster graph Gr , which is 
a refinement of Gr- The recursive algorithm to construct the clique- factor in Gr is called 
the Factor Finder Algorithm. 

Having the above mentioned clique-factor we take any surjcctive function which assigns 
g-cliques of H to (/-cliques of the factor in Gr in such a way that \(j)~^{C)\ = \(f)~^{C')\ for 
every two cliques C,C'. Such a function is obvious to find. Then we assign the vertices of 
the clique C e H to the clusters of (l){C) in the obvious way. This assignment is easily seen 
to satisfy (1) and (2). 

We will finish the embedding in the second stage by the help of the Blow-up Lemma. 
This is a technically somewhat challenging part, however, this stage is more routine. 

The Factor Finder algorithm is a recursive algorithm, with base case q = 2. For an 
easier understanding we will consider the case g = 3 in greater detail, and then generalize 
the method for larger qs. 

3.1 The Factor Finder Algorithm 

Given the graph G, we apply the Degree Form of the Regularity Lemma with parameters 
e and d such that < e <^ d <^ 1. Then we find the reduced graph Gr- We assume that it 
is a balanced (/-partite reduced graph on qi vertices with 5{Gr) > kq/{kq + 1) where kg = 
q — S/2 + hq-i/2. Observe that this is not necessarily the case, since S{G) > kq/{kq + l) does 
not imply the above bound for d{Gr), in general we may lose some edges when discarding 
the irregular pairs and also when putting vertices to Wq (recall Corollary 8). It turns out 
that with a proportional minimum degree this large we will have room to spare in case g > 3. 
Therefore, the algorithm will find the i^^-factor even in case 5{Gr) = kg/ {kg -|- 1) — 7^, where 
7g is a function of q. We will discuss the details at the end. 

In what follows we will denote kg/ {kg + 1) by S, and the cluster classes of Gr will be 
denoted by Ai, A2, . . . , Ag. Recall, that our goal is to show that H C G, where H is the 
disjoint union of n copies of KgS. 

The first case: q = 2 

First, notice that in this case kg = 1, therefore, 6 = 1/2. It is straightforward to find 
a iC2-factor (a perfect matching) in a balanced bipartite graph Gr with a proportional 
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minimum degree this large (Lemma 5). 

Assume, that the proportional minimum degree is a bit larger, it is 1/2 + ^ for some 
< ip < 1. Then one can introduce some randomness in finding the perfect matching. Pick 
tl)i/2 clusters randomly from the first vertex class, and find neighbors for them randomly. 
Then pick ^^/2 clusters randomly from the other vertex class, and find neighbors for them 
randomly. This way we have found random neighbors for ipi clusters. In the leftover the 
minimum degree is sufficiently large for having a perfect matching. Therefore, we can find 
a perfect matching in such a way that clusters have randomly chosen neighbors. 

As it turns out later on, this small extra randomness will be very helpful. When finishing 
the embedding of H wc need a bit larger proportional minimum degree, than 1/2 at the end 
in order to perform this procedure (recall, that we use recursion) , but that will be provided 
for q > 3. 

Finding a triangle factor 

As a warm-up we discuss this case in details. First, apply Theorem 6 for the graphs 
Gr{Ai,A2) and Gr{Ai,A3). We get two /it-regular bipartite graphs R{Ai,A2) and R{Ai,A3), 
with jj, = p{6)£. (Since we can delete as many 1-factors from a regular bipartite graph as 
we please and still get a regular bipartite graph, we may assume that n = p{6)i.) We let R 
to be a 3-partite graph on A1UA2U A3 such that E{R) = E{R{Ai,A2)) U E{R{Ai,A3)). 
It is easy to see, that degji(b) = /x for every b e A2 Li A3. 

We are going to cut the clusters of A2 U A3 randomly into p sub-clusters of equal size. 
The new cluster classes are denoted by A2 and A3. Roughly speaking, we will assign the 
split copies of A2 U A3 to the clusters of Ai, such that every cluster of Ai will receive 2// 
split copies, and every split copy will be assigned to exactly one cluster in Ai. 

More formally, let us define a surjective function a: its domain is the set of split copies, 
and its range is Ai. It satisfies the following requirements: whenever U £ A2U A3, and U 
is a split copy of U, then (t{U) G Nji{U), moreover, if U and U' are different split copies of 
U, then a{U) / cr{U'). For every I^ G ^1 we introduce two sets associated with it: 

N2iW) = {U:UeA2, a(U) = W], 

and 

N3{W) = {[/:[/ G I3, (t{U) = W}. 

It is easy to see, that every cluster of A2 U A3 will participate in one of the NiiyV) sets, and 
\Ni{W)\ = /X for i = 2, 3 and every W £ Ai. 

Our next goal is to show, that Gr{N2{W),N3{W)), the induced subgraph of the refine- 
ment of Gr on N2{W) and N3{W) has a perfect matching M{W) for every W ^ A^. Having 
this perfect matching at hand we can construct p triangles for every W £ Ai: cut the 
clusters of Ai randomly into fi sub-clusters, and assign the split copies of W to the edges 
of M{W) bijectively. This way we construct triangles each having cluster size m//Li. 

Hence, what is left: for every W e Ai find the perfect matchings in the bipartite 
subgraphs Gr{N2(W), N3(W)). We claim that the minimum degree in these bipartite graphs 
is in fact sufficiently large to guarantee the existence of a perfect matching in it. For that 
we will show that every cluster is adjacent to at least half of the clusters in the other class. 
We use a simple claim which we record here for future purposes. 
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Claim 15 Let F = (V, E) he a graph and let S (ZV . Then every u E V is adjacent to at 
least a ^^^^ j)^^ ^^^^ proportion of the vertices of S. 

Proof: Obvious. □ 

Now let U G N2{W) be an arbitrary cluster. By Claim 15 U is adjacent to at least 
{6 — (1 — proportion of the vertices of N^CW). Similarly, every U G N^CW) is 

adjacent to at least {6 —^(1 — /Lt/£))£//i proportion of N2{W). Easy calculation shows that if 
6 = 0.68, then = p{6) = 0.64, and 

5{GriN2iW),N3iW))) > itlStU^lM = o,5, 

This implies the existence of a perfect matching in Gr{N2{W), N^CW)), hence, as we dis- 
cussed above, this can be extended into a triangle factor in the refinement Gr- Notice, that 
0.68 < 0.6923 < ^^^l+hi^i+i ' have found a triangle factor in Gr with a smaller 

bound that is required by Theorem 3. 

This latter fact will be important for us later on. Recall the discussion of case q = 2. 
Obviously, for = 3/2 + /i2/2 the proportional minimum degree will be larger than 1/2 
when it comes to finding the perfect matchings in the Gr{]^2{W), Ns{W)) graphs. Hence, 
we can perform the randomized procedure for finding the perfect matchings. 

As we noted above, having a triangle factor in Gr allows us to find the good pre- 
assignment easily. We remark, that the cluster size in is ^ , and the number of clusters 

is ni = p(5)f. 
The general case 

Assume now that q > 2>. We will apply induction on g, and assume that if the 
proportional minimum degree in a balanced {q — l)-partite cluster graph F is at least 
kq-\/{kq-\ + 1), then F has a iC^-i-factor. 

We are given G,., a balanced g-partite graph with vertex classes Ai, A2, ■ ■ ■ , Ag such that 
^{Gr) ^ kq/{kq + 1). This time our goal will be to find a X^-factor in a refinement Gr- 

Set ji = p{6{Gr))i- We consider the bipartite subgraphs Gr{Ai,Ai) and apply Theorem 6 
to get the //-regular bipartite graphs R{Ai,Ai) for every 2 < i < q. Let i? be a g'-partite 
graph such that V{R) = Ui>iAi and E{R) = \Ji>2E{R{Ai, Ai)). As before, degniU) = p, 
where U e A2U . . .U Aq. 

Similarly to the case q = 3 we randomly split every cluster in A2U A^L) . . .U Aq into p, 
sub-clusters of equal size thereby getting A^ from A^ for 2 < i < q. 

We define a surjcctivc function a: its domain is the set of split copies, and its range is 
Ai. It satisfies the following requirements: whenever U £ A2U . . . L) Aq, and U is a split 
copy of U, then a{U) G Nr{U), moreover, if U and U' are different split copies of U, then 
a{U) / cf{U'). For every 1^ G ^1 we introduce q — 1 sets associated with it: 

Ni{W) = {U:UeAi, a{U) = W} 

for 2 < i < Q". It is easy to see, that every cluster of ^2 U . . . U will participate in one of 
the NiiyV) sets, and |A^i(VF)| = p for every 2 < i < q and every W G Ai. 
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Let us consider the balanced {q — l)-partite graphs Gr{N2{W), . . . , Nq{W)) for every 
W £ Ai. As before, we can lower bound the proportional minimum degree in these graphs 
by the help of Claim 15: 

S{Gr{N,{W), N,iW))) > 

In case q = 3 we had to check whether this quantity was at least 1/2, this time wc have to 
check that this number is sufficiently large so as to guarantee the existence of a Kq_i-factor 
in these graphs. 

Say, that we can find a ETg-i-factor M{W) for every W € Ai. Then we construct the 
desired i^^-factor in the following way: cut the clusters of Ai randomly into fi sub-clusters, 
and assign the split copies of W to the (g — l)-cliques of M{W) bijectively. This way we 
get n cliques of size q each having clusters of size m//x. 

In Lemma 16 below we will prove that S > ^-+1 is sufficiently large, that is, the pro- 
portional minimum degree in S{Gr{N2{W), . . .,Nq{W))) is at least j^^^, which, by the 
induction hypothesis implies the existence of a iTq-i-factor in G^(^2) ■ ■ ■ > ^g) for every 
l<j<i. 



kg-l 



Lemma 16 

SiGr{N2{W),...,Nq{W)))> 
for every l<j<iifq'>3. 
Proof: By Claim 15 we have that 

~5{Gr{N2{W), Nq{W))) > 

Since 

kg I kq — 1 

kq _ fc,+l + V fc,+l 
^^kq + l> 2 

we get the following lower bound for the proportional minimum degree in Gr{N2{a), . . . , Nq{a)): 

~ 2 



, 2(^5+1) ^ 2 V kq+1 _ ^ kg+1 _^ 



kq 



kq + l 

We will show that 

kq-l 



kg + ^ikq-l)ikg+l) ^ kg-l + 1 kg^l + 1 



is a valid inequality. Equivalently, we claim that 
1 > ' = ^ . ' 

kg + J{kg-l){kg + l) kg-^ + I kg-^ + I fe, + . / ( A^, - 1 ) (fc, + 1 ) 
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This is imphed by 



Since kq = kq-i + 1 + l/{2q 

which is easily seen to hold for g > 3. □ 

We have proved that the Factor Finder algorithm can construct a ivTg-factor in Gr 
provided that S{Gr) > kg/ {kg + 1). Observe that apart from the case q = 2 we have room 
to spare in the proportional minimum degree. That is, the algorithm will complete its task 
successfully even in case 6{Gr) = kq/{kq + 1) — 7g if g > 3 and 7^ is sufficiently small. Let 
us choose e and d such that < e <^ d <^ jq, and apply the Regularity Lemma. With this 
choice, by Corollary 8, 5{Gr) > kq/{kq + 1) — 7g. Hence, if e and d are sufficiently small 
then the Factor Finder algorithm will find the clique factor. 

We remark that the bound oi kq = q — 3/2 + /iq_i/2 could be improved somewhat. We 
didn't want to optimize on this bound. It already gives the correct order of magnitude for 
our embedding method: kq = q + O(logg), without having tedious computations in the 
proof of the lemma. 

More on the Factor Finder algorithm 

Let us explore more properties of the Factor Finder algorithm, which will be useful later 

on. Set si{q) = £ for every q > 3. Given a cluster W G Ai wc denote its degree in R{Ai, Aj) 
by S2{q), that is, S2{q) = p{5{Gr))L The recursive process guarantees that we can construct 
a iCq-i-factor in the g — 1 neighborhoods of W, each having size S2{q)- Now for finding 
the ifg-i-factor we again apply recursion, and want to find a Kq-2 factor in S2{q) different 
balanced {q — 2)-partite graphs. The size of the vertex classes of these balanced graphs 
will be denoted by s^{q). In general, when proceeding with the recursion, step-by-step we 
construct balanced (g — i)-partite graphs, in which we look for a Kg-j-factor. The number 
of these graphs is • S2{q) - ■ ■ Si{q). The number of clusters in a class of these balanced 
graphs are denoted by Si{q). We stop i = q — 1, when we arrive to balanced bipartite graphs, 
in which wc arc looking for perfect matchings. 

We can compute the number of cliques in the i^g-factor which contain some split copy 
of a given cluster. 

Lemma 17 Let U be an arbitrary cluster in Gr- The split copies ofU appear in IifZ2^i{q) 
cliques in the Kg- factor of Gr- 

Proof: We want to apply induction, but for doing that we have to be careful. The statement 
we will prove by induction is as follows: 

Claim: Let F be a balanced a-partite cluster graph with cluster classes of size £, and W be 
a cluster of F- If 5{F) > kj/ {kj + 1) where j > a, and we apply the Factor Finder algorithm 
then the number of a-cliques containing a split copy of W is Il^~2Si{j). 



kq + kq^l -^> 2(Vi + !)• 
— 2), the above is a consequence of 
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It is easy to see that this statement is stronger than that of the lemma. Notice, that we 
have to keep track of the size of the cluster classes, too. 

We show that in case a = 3 the above statement holds. Let j > 3. First assume that 
U e Ai. The algorithm finds the neighborhoods N2{U) C A2 and A^3(C/) C A3, both having 
size S2{j)- Next we look for a perfect matching between these two sets, every edge of this 
matching with U will result in a triangle. Hence, the number of triangles having a split 
copy of U is S2{j). 

Suppose, that U G A2, and let W G Nii{U, Ai) be arbitrary. Then there will be triangle 
which contains a split copy of W and a split copy of U. Since this holds for every cluster 
of Nii{U, Ai), and this set has S2(i) clusters, there are S2(i) triangles which contain a split 
copy of U. 

Assume now that a > 3 and that the induction hypothesis holds up to a — 1. Let j > a. 
As above, we begin with the case U e Ai. The algorithm first finds an (a — l)-partite 
cluster graph in which every cluster class has size S2{j), and U is adjacent to every cluster 
of this graph. We want to find a i^a_i-factor in some refinement of it by the Factor Finder 
algorithm. Let W be an arbitrary cluster from the "first" cluster class of the a — 1 classes. 
We have S2{j) possible choices for W. The following is easy to see: for 1 < i < a — 2 the 
cluster classes of the (a — i)-partite graphs constructed by the Factor Finder algorithm will 
be of size Si+i(j). Hence, applying the induction hypothesis, there are H"~2^Sj_|_i(j) cliques 
on a — 1 clusters which contain a split copy of W. We have S2{j) choices for W, therefore, 
the number of g-cliques containing a split copy of U is S2(j)n"~|sj_)_i(j) = H"~2^ Sj(j). 

Finally, we consider the case a > 3 when U € At for t > 1. In the first step there are S2(j) 
clusters of Ai such that these arc adjacent to U in R{Ai, At). Let W be any of these clusters. 
Consider the (a — l)-partite cluster graph which is constructed for W by the algorithm. This 
cluster graph has classes of size S2(j)- As above, we can apply induction, and get that the 
algorithm finds H"~2 cliques on a — 1 clusters which contain a split copy of U. We 

repeat this for every cluster in Nfi{U, Ai), that results in S2(i) different (a — l)-partite 
graphs. In each of these we find n"~|sj+i(j) cliques on a — 1 clusters containing a split 
copy of U. Overall, split copies of U appear in S2{j)IliZ2 ^i+iU) = ^iZ2^i{j) cliques on a 
clusters. □ 

Obviously, si{q) > S2{q) > 53(9) > ■■■Sq-i{q) > for g > 3. The last inequality 
follows from Claim 15 and the fact that the proportional minimum degree in the last graph 
is > 1/2 + V'g for some positive constant depending only on q. (Recall that k^/{k^ + 
1) — 0.68 > 0.01, hence, V'3 > 0.01, and because of Lemma 16 the property of V'g being 
positive is inherited for larger values of q.) Observe, that the overall number of cliques in 
the i^q-factor is ^1^iSi{q) = Vq&^^, where Vq is a constant. This implies, that the cluster 
size in is m = n^^2^j(g)m, where fh is the common cluster size in the refinement Gr, 
and the number of clusters in Gr i = Ti1zlsi{q) = Uql'^~^. 

4 Second stage — Finishing the proof of Theorem 3 

In this section we discuss how to finish the embedding of H into G. Observe, that by 
applying Lemma 12, Proposition 13 and the Blow-up Lemma we are able to embed most of 
H into G: The edges in the cliques of the Kq factor of Gr represent e'-regular pairs, which 
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by Lemma 12 can be made super regular. Applying the Blow-up lemma we get that most 
of H can be embedded into G, at most Se'n + |Wo| < ^e'n vertices are left out, here e' is a 
constant multiple of e. 

Our main goal in this section is to embed the whole of H by the help of the Blow-up 
Lemma. For that we will try to find a ii'^-factor in such a way that every edge in the cliques 
will represent (77, d — ry)-super-regular pairs, where e will be a function of rj. Moreover, every 
vertex of G will sit in a cluster of some clique, and every cluster will have the same size. 
We will achieve this goal in a few steps. First we discard those vertices from the cliques 
which do not have many neighbors in other clusters of the cliques, and put them to Wq, the 
exceptional cluster. Secondly, we will distribute the vertices of Wq such that every edge in 
the Kq-iactor will represent a super-regular pair. Finally, we move vertices between clusters 
so as to get equal size clusters in the cliques, but keep super-regularity, we call this the 
balancing step. Then we will apply the Blow-up Lemma. 

We need an important lemma, which will be crucial for making the cluster sizes equal 
in every clique. In order to state it, let us define q directed graphs: Li,L2, . . . ,Lq. Here 
V{Li) = Ai, the class containing the split copies of the clusters of the zth class. Let 
Ui,U2 G Ai, we will have the directed edge {Ui,U2) € E{Li), if C/i is adjacent to all the 
clusters of the g-clique which contains U2 except U2 itself. That is, if W is a cluster of this 
clique, then the {Ui,W) pair is e'-regular. We will also say that Ui is adjacent to the clique 
of U2- We will show the following: 

Lemma 18 Let Ui,U2 G Ai for some 1 < i < q, and let Ui be any split copy of Ui in Gr- 
Then with probability at least 1 — l/{2q^l^) there are more than |Vg*g-i(9)n^=3 ^1(9) 
copies of U2 such that Ui is adjacent to its clique. 

The main message of Lemma 18 is that out of the Ilil2Si{q) cliques in the factor which 
contain some split copy of U2 a constant proportion is adjacent to some split copy oi Ui, 

independently of the choice of C/i and U2. 

Proof: We will follow the line of arguments of the proof of Lemma 17. The extra cluster 
Ui can be considered as having one more cluster class. More precisely, the effect of having 
Ui is as follows. When computing the number of cliques having a split copy U2, at every 
step we have to take into account whether the clusters are in the neighborhood of Ui. This 
shrinks the sizes: if the cluster class size in question is Si{q), then out of this many clusters 
at least Si+i(g) is adjacent to Ui. 

This estimation works smoothly until at the end we have to find a perfect matching 
in a bipartite graph having cluster classes of size Sq_i(g) each. Then Ui is adjacent to at 
least (1/2 + 'ipq)sq-i{q) clusters in both classes. Recall that we find the perfect matching in 
the following way: We randomly, independently, with probability choose clusters, and 
pick a random vacant neighbor for those. The rest can get a neighbor by any algorithm 
for finding a perfect matching. Suppose that we choose W for having a random neighbor. 
Since W and Ui have a common neighborhood of size at least ipqSq-i{q), the probability 
that W will get a neighbor in the perfect matching which is adjacent to Ui is at least 
ipq. The expected number of cliques containing some split copy U2 and being adjacent to 
Ui is at least j'il^qSq-i{q)Il1~^Si{q), here we applied the bound of Lemma 17. Standard 
probabilistic reasoning - use e.g., Azuma's inequality - shows that Ui will be adjacent to 
at least ^ipqSq-i{q)IliZl Si{q) split copies of U2 with probability at least 1 - l/{2q'^£^). □ 
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Observe, that if U ,U' are split copies of U, then U is adjacent to the chque of U' . 
Together with the so called union bound in probability theory this implies the following: 

Corollary 19 With positive probability there are at least ^ipgSq-i{q)Il'-Z^Si{q) vertex dis- 
joint directed paths of length at most two between any two clusters in Lj, for every 1 < i < q. 

We have acquired the knowledge to achieve our main goal, in the rest of the section we 
discuss how to finish the embedding step by step. 

In the first step we make every edge in the cliques of the factor super-regular by applying 
Lemma 12, the discarded vertices will be put to Wq. Then the enlarged extremal cluster Wq 
will be larger, but still remain reasonably small: |Wo| < s^'^^j where e' is a constant multiple 
of e. 

In the second step we will distribute the vertices of Wq among the £ clusters of Gr- Let 
V £ Wq and C7 be a cluster. We say that v is adjacent to the clique of U if v has at least 
din neighbors in every cluster in the clique of U, except in U itself. Notice, that the proof 
of Lemma 18 shows, that for every v G Wq there are at least ^'ijjq£sq-i{q)Ilf~^ Si{q) clusters 
such that V is adjacent to their cliques. Since the number of cliques is £ = IlfZlsi{q), every 
vertex is adjacent to Cq£ cliques, where Cq = Sq-i{q)'tpg/{8s2{q))- 

When distributing the vertices of Wq we are allowed to put a vertex v to a cluster Uifv 
is adjacent to the clique of U. We pay attention to distribute the vertices evenly, that is, at 
the end no cluster will get more than |lVo|/(cq-^) new vertices from Wq. Since every vertex 
is adjacent to many cliques, this can be achieved. After this step every edge of every clique 
in the ii'^-factor will represent super-regular pairs. 

It is possible, that the clusters have different sizes in a clique, hence, we have to perform 
the balancing algorithm. For that we partition the clusters of Gr into three sets: S<^,S= 
and 5>. S'< contains those clusters which have less than n/£ vertices, S'> contains those 
clusters which have more than n/£ vertices, and S= contains the rest with equality. We will 
apply Corollary 19 in order to find directed paths from clusters of S'> to clusters in S'<. 

Say, that C/i G >S'>, C/2 £ 'S'< and there is a path of length one between them, that is, 
U1U2 € E{Li) for some \ <i < q. Then the vast majority of the vertices of U\ are adjacent 
to the clique of C/2. Pick as many as needed (and possible) among these and place them to 
U2- If the path is of length two, then choose a cluster such that UiU^ and U2,U2 belong 
to E{Li). Again, the vast majority of the vertices in U\ are adjacent to the clique of C/3 and 
the vast majority of the vertices of C/3 are adjacent to the clique of U2- Hence, by placing 
vertices from Ui to C/3 and the same number of vertices from to U2 we can decrease the 
discrepancy of Ui and U2 such that we keep the edges super-regular in all the cliques in 
question. Observe, that we can perform the balancing algorithm such that we do not take 
out more than |iyo|/(cg^) vertices from any of the clusters, and do not put in more than 
|Wo|/(cq^) vertices to any of the cluster. 

We can apply Lemma 10, and get that the edges of the cliques represent (e, d)-super- 
regular pairs, where e < Cy/e and d > d — e, and C is a constant. 

At this point we can recognize, that with positive probability all conditions of Lemma 14 
are satisfied if e is suflficiently small and e <^ d <^ 1. Prom this the proof of Theorem 3 
follows. 
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Proof of Corollary 4: We can embed vertex disjoint copies of H as follows: first, find a 
Kq-ia,cioT in G. Then color some vertex disjoint union of copies oi H hy q colors such that 
every color class has size k and every color is used. Call this colored graph H. It is easy to 
see that G has an iJ-factor: we embed the copies of H in the cliques of the if^-factor by 
the help of the Blow-up Lemma. □ 
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